SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: HOGREFE, Holly 

HANSEN, Connie J 

(ii) TITLE OF INVENTION: Polymerase Enhancing Factor (PEF) 

Extracts, PEF Protein Complexes, Isolated PEF Proteins, 
and Methods for Purifying and Identifying Them 

(iii) NUMBER OF SEQUENCES: 89 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: David J. Kulik, Evenson, McKeown, Edwards & 
Lenahan P.L.L.C. 

(B) STREET: 1200 G Street, NW Suite 700 

(C) CITY: Washington 

(D) STATE: DC 
(F) ZIP: 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 24-OCT-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: KULIK, David J 

(B) REGISTRATION NUMBER: 36,576 

(C) REFERENCE/DOCKET NUMBER: 1486/43 163cp 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202 628-8800 

(B) TELEFAX: 202 628-8844 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Xaa Leu His His Val Lys Leu lie Tyr Ala Thr Xaa Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE : NO 
(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Leu Leu His His Val Lys Leu lie Tyr Ala Thr Lys Xaa Arg Xaa 
15 10 15 

Leu Val Gly Lys Xaa lie Val Leu Ala lie Pro Gly Xaa Xaa Ala Xaa 
20 25 30 

Xaa Xaa Xaa 
35 

(2) INFORMATION FOR SEQ ID NO: 4: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

ANTI-SENSE: NO 

FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Glu Xaa 
15 10 15 

Xaa Xaa 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 






(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Tyr Asp Ala Val He Met Ala Ala Ala Val Val Asp Phe Arg Pro 



Lys 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

_ (iv) ANTI-SENSE: NO 

ffl (v) FRAGMENT TYPE: internal 

Cl (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

~S Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 

^ 1 5 10 15 

J? Asn Gin Val Val Leu He Gly Arg 

H 20 

=P (2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Ala Met Leu His His Val Lys Leu He Tyr Ala Xaa Lys Leu Arg 
15 10 15 



1 



5 



10 



15 



' ' ' • 

Lys 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Ala Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Xaa Xaa 
15 10 15 

Arg Lys 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Lys Phe Arg Lys Glu Glu Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Ala lie Leu Leu Pro Asp Trp Lys lie Arg Lys Glu lie Leu lie 
15 10 15 

Glu 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Met His His Val lie Lys Leu Xaa Tyr Ala Thr Xaa Ser Arg Lys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



gfj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Tyr Leu Val Arg Pro Asp Trp Lys Arg Arg Lys Glu lie Leu 
1 5 10 15 

'.IE lie Glu 



(2) INFORMATION FOR SEQ ID NO: 14: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CAYCAYGAHA ARYTHATTTA CGC 23 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15 
GCCATDATNA CDGCRTCGTA TTT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CAYCAYGAHA ARYTHATATA CGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
ARDACDACYT GRTTTTCTTC 




(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : u nkno wn 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATGCTTCACC ACGTCAAGCT AATCTACGCC ACAAAAAGTC GAAAGCTAGT TGGAAAAAAG 60 

ATAGTCNNNN NNNNNCCAGG GAGTATTGCG GCTTTGGATG TGAAAGCTTG TGAGGGACTA 120 

ATTAGGCATG GGGCCGAAGT TCATGCAGTG ATGAGTGAGG CAGCCACCAA GATAATTCAT 180 

CCTTATGCAT GGAATTTGCC CACGGGAAAT CCAGTCATAA CTGAGATCAC TGGATTTATC 240 

GAGCATGTTG AGTTAGCAGG GGAACATGAG AATAAAGCAG ATTTAATTTT GGTTTGTCCT 300 

GCCACTGCCA ACACAATTAG TAAGATTGCA TGTGGAATAG ATGATACTCC AGTAACTACA 360 

GTCGTGACCA CAGCATTTCC CCACATTCCA ATTATGATAG CCCCAGCAAT GCATGAGACA 420 

ATGTACAGGC ATCCCATAGT AAGGGAGAAC ATTGAAAGGT TAAAGAAGCT TGGCGTTGAG 480 

TTTATAGGAC CAAGAATTGA GGAGGGAAAG GCAAAAGTTG CAAGCATTGA TGAAATAGTT 540 

TACAGAGTTA TTAAAAAGCT CCACAAAAAA ACATTGGAAG GGAAGAGAGT CCTAGTAACG 600 

GCGGGAGCAA CAAGAGAGTA CATAGATCCA ATAAGATTCA TAACAAATGC CAGCAGTGGA 660 

AAAATGGGAG TAGCGTTGGC TGAAGAAGCA GATTTTAGAG GAGCTGTTAC CCTCATAAGA 720 

ACAAAGGGAA GTGTAAAGGC TTTTAGAATC AGAAAAATCA AATTGAAGGT TGAGACAGTG 780 

GAAGAAATGC TTTCAGCGAT TGAAAATGAG TTGAGGAGTA AAAAGTATGA CGTAGTTATT 840 

ATGGCAGCTG CTGTAAGCGA TTTTAGGCCA AAAATTAAAG CAGAGGGAAA AATTAAAAGC 900 

GGAAGATCAA TAACGATAGA GCTCGTTCCN NNNAATCCCA AAATCATTGA TAGAATAAAG 960 

GAAATTCAAC CAAATGTCTT TCTTGTTGGA TTTAAAGCAG AAACTTCAAA AGAAAAGCTT 1020 

ATAGAAGAAG GTAAAAGGCA GATTGAGAGG GCCAAGGCTG ACTTAGTCGT TGGTAACACA 1080 

TTGGAAGCCT TTGGAAGCGA GGAAAACCAA GTAGTATTAA TTGGCAGAGA TTTCACAAAA 1140 




GAACTTCCAA AAATGAAAAA GAGAGAGTTA GCAGAGAGAA TTTGGGATGA GATAGAGAAA 1200 
TTNCTGTCC 1209 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Ser Arg Lys Leu 
15 10 15 

Val Gly Lys Lys lie Val Xaa Xaa Xaa Pro Gly Ser lie Ala Ala Leu 
20 25 30 

Asp Val Lys Ala Cys Glu Gly Leu lie Arg His Gly Ala Glu Val His 
35 40 45 

Ala Val Met Ser Glu Ala Ala Thr Lys lie He His Pro Tyr Ala Trp 
50 55 60 

Asn Leu Pro Thr Gly Asn Pro Val He Thr Glu He Thr Gly Phe He 
65 70 75 80 

Glu His Val Glu Leu Ala Gly Glu His Glu Asn Lys Ala Asp Leu He 
85 90 95 

Leu Val Cys Pro Ala Thr Ala Asn Thr He Ser Lys He Ala Cys Gly 
100 105 110 

He Asp Asp Thr Pro Val Thr Thr Val Val Thr Thr Ala Phe Pro His 
115 120 125 

He Pro He Met He Ala Pro Ala Met His Glu Thr Met Tyr Arg His 
130 135 140 

Pro He Val Arg Glu Asn He Glu Arg Leu Lys Lys Leu Gly Val Glu 
145 150 155 160 

Phe He Gly Pro Arg He Glu Glu Gly Arg Ala Lys Val Ala Ser He 
165 170 175 

Asp Glu He Val Tyr Arg Val He Lys Lys Leu His Lys Lys Thr Leu 
180 185 190 



Glu Gly Lys Arg Val Leu Val Thr Ala Gly Ala Thr Arg Glu Tyr He 



195 200 205 

Asp Pro lie Arg Phe lie Thr Asn Ala Ser Ser Gly Lys Met Gly Val 
210 215 220 

Ala Leu Ala Glu Glu Ala Asp Phe Arg Gly Ala Val Thr Leu lie Arg 
225 230 235 240 

Thr Lys Gly Ser Val Lys Ala Phe Arg lie Arg Lys lie Lys Leu Lys 
245 250 255 

Val Glu Thr Val Glu Glu Met Leu Ser Ala lie Glu Asn Glu Leu Arg 
260 265 270 

Ser Lys Lys Tyr Asp Val Val lie Met Ala Ala Ala Val Ser Asp Phe 
275 280 285 

Arg Pro Lys lie Lys Ala Glu Gly Lys lie Lys Ser Gly Arg Ser lie 
290 295 300 

Thr lie Glu Leu Val Pro Xaa Asn Pro Lys lie lie Asp Arg lie Lys 
305 310 315 320 

Glu lie Gin Pro Asn Val Phe Leu Val Gly Phe Lys Ala Glu Thr Ser 
325 330 335 

Lys Glu Lys Leu He Glu Glu Gly Lys Arg Gin He Glu Arg Ala Lys 
340 345 350 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
355 360 365 

Asn Gin Val Val Leu He Gly Arg Asp Phe Thr Lys Glu Leu Pro Lys 
370 375 380 

Met Lys Lys Arg Glu Leu Ala Glu Arg He Trp Asp Glu He Glu Lys 
385 390 395 400 

Xaa Leu Ser 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii). MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 
CATAGCGAAT TCGCAAAACC TTTCGCGGTA TGG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
ACTACGGAAT TCCACGGAAA ATGCCGCTCA TCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GGCGTTTCCG TTCTTCTTCG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CCATCTCACG CGCCAGTTTC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

| GAGGAGAGCA GGAAAGGTGG AAC 
"l (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
GAGCTTGCTC AACTTTATC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
GATAGAGATA GTTTCTGGAG ACG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

CGGGATATCG ACATTTCTGC ACC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CAGGACTCAG AAGCTGCTAT CGAA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
CTGCACGTGC CCTGTAGGAT TTGT 




(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
CCAGAYTGGA ARWKNAGGAA AGA 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CCAGAYTGGA ARWKNAGAAA AGA 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CCAGAYTGGA ARWKNAGGAA GGA 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCAGAYTGGA ARWKNAGAAA GGA 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CAGAGTGGGC AGAGAGGCTN TTGTTAAGGG GAAATTAATC GACGTGGAAA 
AGGAAGGAAA 60 

AGTCGNTATT CCTCCAAGGG AATA 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



rj 




(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Glu Trp Ala Glu Arg Leu Leu Leu Arg Gly Asn Xaa Ser Lys Trp Lys 
15 10 15 

Arg Lys Glu Lys Ser Xaa Phe Leu Gin Gly Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Arg Val Gly Arg Glu Ala Xaa Val Lys Gly Lys Leu lie Glu Val Glu 
15 10 15 

Lys Glu Gly Lys Val Xaa lie Pro Pro Arg Glu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 



(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Gin Ser Gly Gin Arg Gly Xaa Cys Xaa Gly Glu lie Asn Arg Ser Gly 
15 10 15 

Lys Gly Arg Lys Ser Arg Tyr Ser Ser Lys Gly Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY:, unknown . 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTGCCCACTC TGAGGTCATA ACCTGCTGGT TGGAGCCATT CTTCAGAAAA TGGCTCTATA 60 
AGTATTTCTT TTCTGATTTT CCAGTCTGGA AGTAGCATTT TACCACCGAA ACCTTTATTT 120 
TTAATTTAA 129 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

Xaa lie Lys Asn Lys Gly Phe Gly Gly Lys Met Leu Leu Pro Asp Trp 
15 10 15 



• # 

Lys He Arg Lys Glu He Leu He Glu Pro Phe Ser Glu Glu Trp Leu 
20 25 30 



Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 
35 40 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TCCTCCAAGG GAATACGCCT TAATCCTAAC CCTCGAGAGG ATAAAGTTGC CCGACGATGT 60 

TATGGGGGAT ATGAAGATAA GGAGCAGTTT AGCAAGAGAA GGGGTTATTG GTTCTTTTGC 120 

TTGGGTTGAC CCAGGATGGG ATGGAAACTT AACACTAATG CTCTACAATG CCTCAAATGA 180 

ACCTGTCGAA TTAAGATATG GAGAGAGATT TGTGCAGATC GCATTTATAA GGCTAGAGGG 240 

TCCGGCAAGA AACCCTTACA GAGGAAACTA TCAGGGGAGC ACAAGGTTAG CGTTTTCAAA 300 

GAGAAAGAAA CTCTAGCGTC TTTTCAATAG CATCCTCAAT ATCTCGTGTG AAGTAATCAA 360 

TGTAAATACT TGCTGGGTGG GTTTTTAGGG ATTCAAACTC GTAAGATGGG CCTGTATAGC 420 

AGAAAACTAT TTTTGCCTCT TCTTCATTTA TCTTTCTGTG AATAAAAAAT CCAACATCCA 480 

CACTAGTTCC AAAAGATATT GTTTGCGTGA TTACCAACAA GATCTTGGCA TTATTTTTGA 540 

TCTTATACTC TATTCTCCTT TCTCCCTCCA ATTTGCCCAA AATAAACCTG GGTAGTATAC 600 

ATTCACTCCT CTCTTTTAAA TTCCTATAAA TTCGTACATA GTTTAGAAAA ATGTCAAATT 660 

CTTTNTTCCC TGTTAAATTA ACCNCNAAAT CTTTATNANN AANCTTTTTA TAATTCCCAA 720 

AACCCCTAAT TTTCCCCTTN 740 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 



( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Leu Gin Gly Asn Thr Pro Xaa Ser Xaa Pro Ser Arg Gly Xaa Ser Cys 
15 10 15 

Pro Thr Met Leu Trp Gly lie Xaa Arg Xaa Gly Ala Val Xaa Gin Glu 
20 25 30 

Lys Gly Leu Leu Val Leu Leu Leu Gly Leu Thr Gin Asp Gly Met Glu 
35 40 45 

Thr Xaa His Xaa Cys Ser Thr Met Pro Gin Met Asn Leu Ser Asn Xaa 
50 55 60 

Asp Met Glu Arg Asp Leu Cys Arg Ser His Leu Xaa Gly Xaa Arg Val 
65 70 75 80 

Arg Gin Glu Thr Leu Thr Glu Glu Thr lie Arg Gly Ala Gin Gly Xaa 
85 90 95 

Arg Phe Gin Arg Glu Arg Asn Ser Ser Val Phe Ser lie Ala Ser Ser 
100 105 110 

He Ser Arg Val Lys Xaa Ser Met Xaa He Leu Ala Gly Trp Val Phe 
115 120 125 

Arg Asp Ser Asn Ser Xaa Asp Gly Pro Val Xaa Gin Lys Thr He Phe 
130 135 140 

Ala Ser Ser Ser Phe He Phe Leu Xaa He Lys Asn Pro Thr Ser Thr 
145 150 155 160 

Leu Val Pro Lys Asp He Val Cys Val He Thr Asn Lys He Leu Ala 
165 170 175 

Leu Phe Leu He Leu Tyr Ser He Leu Leu Ser Pro Ser Asn Leu Pro 
180 185 190 

Lys He Asn Leu Gly Ser He His Ser Leu Leu Ser Phe Lys Phe Leu 
195 200 205 

Xaa He Arg Thr Xaa Phe Arg Lys Met Ser Asn Ser Xaa Phe Pro Val 
210 215 220 



Lys Leu Thr Xaa Lys Ser Leu Xaa Xaa Xaa Phe Leu Xaa Phe Pro Lys 
225 230 235 240 



Pro Leu lie Phe Pro Xaa 
245 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Pro Arg Glu Tyr Ala Leu lie Leu Thr Leu Glu Arg lie Lys Leu 
15 10 15 

Pro Asn Asn Val Met Gly Asp Met Lys lie Arg Ser Ser Leu Ala Arg 
20 25 30 

Glu Gly Val lie Gly Ser Phe Ala Trp Val Asp Pro Gly Trp Asp Gly 
35 40 45 

Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu Pro Val Glu Leu 
50 55 60 

Arg Tyr Gly Glu Arg Phe Val Gin lie Ala Phe lie Arg Leu Glu Gly 
65 70 75 80 

Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly Ser Thr Arg Leu 
85 90 95 

Ala Phe Ser Lys Arg Lys Lys Leu Xaa Arg Leu Phe Asn Ser lie Leu 
100 105 110 



Asn He Ser Cys Glu Val He Asn Val Asn Thr Cys Trp Val Gly Phe 
115 120 125 

Xaa Gly Phe Lys Leu Val Arg Trp Ala Cys He Ala Glu Asn Tyr Phe 
130 135 140 

Cys Leu Phe Phe He Tyr Leu Ser Val Asn Lys Lys Ser Asn He His 
145 150 155 160 

Thr Ser Ser Lys Arg Tyr Cys Leu Arg Asp Tyr Gin Gin Asp Leu Gly 
165 170 175 



He He Phe Asp Leu He Leu Tyr Ser Pro Phe Ser Leu Gin Phe Ala 
180 185 190 



Gin Asn Lys Pro Gly Xaa Tyr Thr Phe Thr Pro Leu Phe Xaa He Pro 
195 200 205 

He Asn Ser Tyr He Val Xaa Lys Asn Val Lys Phe Phe Xaa Pro Cys 
210 215 220 

Xaa He Asn Xaa Xaa He Phe Xaa Xaa Xaa Leu Phe He He Pro Lys 
225 230 235 240 

Thr Pro Asn Phe Pro Leu 
245 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Ser Ser Lys Gly He Arg Leu Asn Pro Asn Pro Arg Glu Asp Lys Val 
15 10 15 

Ala Arg Arg Cys Tyr Gly Gly Tyr Glu Asp Lys Glu Gin Phe Ser Lys 
20 25 30 

Arg Arg Gly Tyr Trp Phe Phe Cys Leu Gly Xaa Pro Arg Met Gly Trp 
35 40 45 

Lys Leu Asn Thr Asn Ala Leu Gin Cys Leu Lys Xaa Thr Cys Arg He 
50 55 60 

Lys He Trp Arg Glu He Cys Ala Asp Arg He Tyr Lys Ala Arg Gly 
65 70 75 80 

Ser Gly Lys Lys Pro Leu Gin Arg Lys Leu Ser Gly Glu His Lys Val 
85 90 95 

Ser Val Phe Lys Glu Lys Glu Thr Leu Ala Ser Phe Gin Xaa His Pro 
100 105 110 

Gin Tyr Leu Val Xaa Ser Asn Gin Cys Lys Tyr Leu Leu Gly Gly Phe 
115 120 125 

Leu Gly He Gin Thr Arg Lys Met Gly Leu Tyr Ser Arg Lys Leu Phe 
130 135 140 



Leu Pro Leu Leu His Leu Ser Phe Cys Glu Xaa Lys He Gin His Pro 
145 150 155 160 




His Xaa Phe Gin Lys lie Leu Phe Ala Xaa Leu Pro Thr Arg Ser Trp 
165 170 175 

His Tyr Phe Xaa Ser Tyr Thr Leu Phe Ser Phe Leu Pro Pro lie Cys 
180 185 190 

Pro Lys Xaa Thr Trp Val Val Tyr lie His Ser Ser Leu Leu Asn Ser 
195 200 205 

Tyr Lys Phe Val His Ser Leu Glu Lys Cys Gin lie Leu Xaa Ser Leu 
210 215 220 

Leu Asn Xaa Pro Xaa Asn Leu Tyr Xaa Xaa Xaa Phe Tyr Asn Ser Gin 
225 230 235 240 

Asn Pro Xaa Phe Ser Pro 
245 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Ser Arg Lys Leu 
15 10 15 

Val Gly Lys Lys lie Val Xaa Xaa Xaa Pro Gly Ser lie Ala Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

*(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Lys Tyr Asp Val Val lie Met Ala Ala Ala Val Ser Asp Phe Arg Phe 
15 10 15 

Lys 




(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
15 10 15 

Asn Gin Val Val Leu He Gly Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CTATTGAGTA CGAACGCCAT C 21 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GTCACGCTTG CTCCACTCCG 20 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanococcus Jannaschii 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met lie Ser Glu lie Met His Pro Thr Lys Leu Leu Lys Gly Thr Lys 
15 10 15 

Ser Lys Leu Leu Glu Asn Lys Lys He Leu Val Ala Val Thr Ser Ser 
20 25 30 

He Ala Ala He Glu Thr Pro Lys Leu Met Arg Glu Leu He Arg His 
35 40 45 

Gly Ala Glu Val Tyr Cys He He Thr Glu Glu Thr Lys Lys He He 
50 55 60 



Gly Lys Glu Ala Leu Lys Phe Gly Cys Gly Asn Glu Val Tyr Glu Glu 



# 




65 70 75 80 



lie Thr Gly Xaa Xaa Xaa Xaa Xaa Asp lie Glu His lie Leu Leu Tyr 
85 90 95 

Xaa Xaa Xaa Xaa Asn Glu Cys Asp Cys Leu Leu lie Tyr Pro Ala Thr 
100 105 110 

Ala Asn lie lie Ser Lys lie Asn Leu Gly lie Ala Asp Asn lie Val 
115 120 125 

Asn Thr Thr Ala Leu Met Phe Phe Gly Asn Lys Pro lie Phe lie Val 
130 135 140 

Pro Ala Met His Glu Asn Met Phe Asn Xaa Xaa Ala lie Lys Arg His 
145 150 155 160 

lie Asp Lys Leu Lys Glu Lys Asp Lys lie Tyr lie lie Ser Pro Lys 
165 170 175 

Phe Glu Glu Xaa Xaa Xaa Xaa Xaa Xaa Gly Lys Ala Lys Val Ala Asn 
180 185 190 

lie Glu Asp Val Val Lys Ala Val lie Glu Lys lie Gly Asn Asn Leu 
195 200 205 

Lys Lys Glu Gly Asn Arg Val Leu lie Leu Asn Gly Gly Thr Val Glu 
210 215 220 

Phe lie Asp Lys Val Arg Val lie Ser Asn Leu Ser Ser Gly Lys Met 
225 230 235 240 

Gly Val Ala Leu Ala Glu Ala Phe Cys Lys Glu Gly Phe Tyr Val Glu 
245 250 255 

Val lie Thr Ala Met Gly Leu Glu Pro Pro Tyr Tyr lie Lys Asn His 
260 265 270 

Lys Val Leu Thr Ala Lys Glu Met Leu Asn Lys Ala lie Glu Xaa Xaa 
275 280 285 

Leu Xaa Ala Lys Asp Phe Asp lie lie lie Ser Ser Ala Ala lie Ser 
290 295 300 

Asp Phe Thr Val Glu Ser Xaa Phe Glu Gly Lys Leu Ser Ser Glu Glu 
305 310 315 320 

Glu Xaa Xaa Xaa Xaa Leu lie Leu Lys Leu Lys Arg Xaa Asn Pro Lys 
325 330 335 

Val Leu Glu Glu Leu Arg Arg lie Tyr Lys Asp Xaa Lys Val lie lie 
340 345 350 



Gly Phe Lys Ala Glu Tyr Asn Leu Asp Glu Lys Glu Leu lie Asn Arg 
355 360 365 



Ala Lys Glu Arg Leu Asn Lys Tyr Asn Leu Asn Met lie lie Ala Asn 
370 375 380 



Asp Leu Ser Lys Xaa Xaa His Tyr Phe Gly Asp. Asp Tyr lie Glu Val 
385 390 395 400 

Tyr He He Thr Lys Tyr Glu Val Glu Lys He Ser Gly Ser Lys Lys 
405 410 415 

Xaa Glu He Ser Glu Arg He Val Glu Lys Val Lys Lys Leu Val Lys 
420 425 430 

Ser Xaa Xaa Xaa Xaa 
435 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Ala Arg Gin Gin Lys Tyr Cys Asp Lys He Ala Asn Phe Trp 
15 10 15 

Cys His Pro Thr Gly Lys He He Met Ser Leu Ala Gly Lys Lys He 
20 25 30 

Val Leu Gly Val Ser Gly Gly He Ala Ala Tyr Lys Thr Pro Glu Leu 
35 40 45 

Val Arg Arg Leu Arg Asp Arg Gly Ala Asp Val Arg Val Ala Met Thr 
50 55 60 

Glu Ala Ala Lys Ala Phe He Thr Pro Leu Ser Leu Gin Ala Val Ser 
65 70 75 80 

Gly Tyr Pro Val Ser Asp Ser Leu Leu Asp Pro Ala Ala Glu Ala Ala 
85 90 95 

Met Gly His He Glu Leu Gly Xaa Xaa Xaa Xaa Lys Trp Ala Asp Leu 
100 105 110 



Val He Leu Ala Pro Ala Thr Ala Asp Leu He Ala Arg Val Ala Ala 




115 120 125 

Gly Met Ala Asn Asp Leu Val Ser Thr lie Cys Leu Ala Thr Pro Xaa 
130 135 140 

Xaa Ala Pro Val Ala Val Leu Pro Ala Met Asn Gin Gin Met Tyr Arg 
145 150 155 160 

Ala Ala Ala Thr Gin His Asn Leu Glu Val Leu Ala Xaa Ser Arg Gly 
165 170 , 175 

Leu Leu lie Trp Gly Pro Asp Ser Gly Ser Gin Ala Cys Gly Asp lie 
180 185 190 

Gly Pro Gly Arg Xaa Xaa Asp Pro Leu Thr lie Val Asp Met Ala Val 
195 200 205 

Ala His Phe Ser Pro Val Asn Asp Leu Lys His Leu Asn lie Met lie 
210 215 220 

Thr Ala Gly Pro Thr Arg Glu Pro Leu Asp Pro Val Arg Tyr lie Ser 
225 230 235 240 

Asn His Ser Ser Gly Lys Met Gly Phe Ala lie Ala Ala Ala Ala Ala 
245 250 255 

Arg Arg Gly Ala Asn Val Thr Leu Val Ser Gly Pro Val Ser Leu Pro 
260 265 270 

Thr Pro Pro Phe Val Lys Arg Val Asp Val Met Thr Ala Leu Glu Met 
275 280 285 

Glu Ala Ala Val Asn Xaa Xaa Ala Ser Val Gin Gin Gin Asn lie Phe 
290 295 300 

He Gly Cys Ala Ala Val Ala Asp Tyr Arg Ala Ala Thr Val Ala Pro 
305 310 315 320 

Glu Lys He Lys Lys Gin Ala Thr Gin Gly Asp Glu Leu Thr He Lys 
325 330 335 

Met Val Lys Xaa Asn Pro Asp He Val Ala Gly Val Ala Ala Leu Lys 
340 345 350 

Asp His Arg Pro Tyr Val Val Gly Phe Ala Ala Glu Thr Asn Asn Xaa 
355 360 365 

Xaa Xaa Xaa Val Glu Glu Tyr Ala Arg Gin Lys Arg He Arg Lys Asn 
370 375 380 

Leu Asp Leu He Cys Ala Asn Asp Val Ser Gin Pro Thr Gin Gly Phe 
385 390 395 400 



Asn Ser Asp Asn Asn Ala Leu His Leu Phe Trp Gin Asp Gly Asp Lys 
405 410 415 



Val Leu Pro Leu Glu Arg Lys Glu Leu Leu Gly Gin Leu Leu Leu Asp 
420 425 430 



Glu lie Val Thr Arg Tyr Asp Glu Lys Asn Arg Arg 
435 440 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Ul Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 

SI 1 5 10 

p (2) INFORMATION FOR SEQ ID NO: 54: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 




(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Ala Gly Trp lie Asp Ala Gly Phe Lys Gly Lys lie Thr Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D.) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Pro Thr lie Val Asp Ala Gly Phe Glu Gly Gin Leu Thr lie 
15 10 

(2) INFORMATION FOR SEQ ID NO: 58: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Ala His Arg lie Asp Pro Gly Trp Ser Gly Cys lie Val Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



CAGGACTCAG AAGCTGCTAT CGAA 

(2) INFORMATION FOR SEQ ID NO: 61: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CTGCACGTGC CCTGTAGGAT TTGT 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CTATTGAGTA CGAACGCCAT C 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



* 



GTCACGCTTG CTCCACTCCG 20 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GAGGAGAGCA GGAAAGGTGG AAC 23 

^ (2) INFORMATION FOR SEQ ID NO: 65: 

ffl (i) SEQUENCE CHARACTERISTICS: 
SB (A) LENGTH: 21 base pairs 

in (B) TYPE: nucleic acid 

M (C) STRANDEDNESS: single 

\J (D) TOPOLOGY: unknown 

yg (ii) MOLECULE TYPE: DNA (genomic) 

jU (iii) HYPOTHETICAL: NO 

if (iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GCTGGGAGAA GACTTCACTG G 21 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 



His His Val Lys Leu lie Tyr Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

Lys Tyr Asp Ala Val lie Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

Glu Glu Asn Gin Val Val Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : u nkno wn 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Pro Asp Trp Lys lie Arg Lys Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATGCTACTTC CAGACTGGAA AATCAGAAAA GAAATACTTA TAGAGCCATT TTCTGAAGAA 60 

TCGCTCCAAC CAGCAGGTTA TGACCTCAGA GTGGGCAGAG AGGCTTTTGT TAAGGGGAAA 120 

TTAATCGACG TGGAAAAGGA AGGAAAAGTC GTTATTCCTC CAAGGGAATA CGCCTTAATC 180 

CTAACCCTCG AGAGGATAAA GTTGCCCGAC GATGTTATGG GGGATATGAA GATAAGGAGC 240 

AGTTTAGCAA GAGAAGGGGT TATTGGTTCT TTTGCTTGGG TTGACCCAGG ATGGGATGGA 300 

AACTTAACAC TAATGCTCTA CAATGCCTCA AATGAACCTG TCGAATTAAG ATATGGAGAG 360 

AGATTTGTGC AGATCGCATT TATAAGGCTA GAGGGTCCGG CAAGAAACCC TTACAGAGGA 420 

AACTATCAGG GGAGCACAAG GTTAGCGTTT TCAAAGAGAA AGAAACTCTA G 471 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Leu Leu Pro Asp Trp Lys He Arg Lys Glu He Leu He Glu Pro 
15 10 15 

Phe Ser Glu Glu Ser Leu Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 
20 25 30 

Arg Glu Ala Phe Val Lys Gly Lys Leu He Asp Val Glu Lys Glu Gly 
35 40 45 

Lys Val Val He Pro Pro Arg Glu Tyr Ala Leu He Leu Thr Leu Glu 
50 55 60 

Arg He Lys Leu Pro Asp Asp Val Met Gly Asp Met Lys He Arg Ser 
65 70 75 80 

Ser Leu Ala Arg Glu Gly Val He Gly Ser Phe Ala Trp Val Asp Pro 
85 90 95 

Gly Trp Asp Gly Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu 
100 105 110 

Pro Val Glu Leu Arg Tyr Gly Glu Arg Phe Val Gin He Ala Phe He 
115 120 125 

Arg Leu Glu Gly Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly 
130 135 140 

Ser Thr Arg Leu Ala Phe Ser Lys Arg Lys Lys Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
15 10 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ala Gly Trp lie Asp Ala Gly Phe Lys Gly Lys lie Thr Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE : NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 

Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Pro Thr He Val Asp Ala Gly Phe Glu Gly Gin Leu Thr He 
15 10 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



Ala His Arg He Asp Pro Gly Trp Ser Gly Cys He Val Leu 
15 10 




(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Val Gly Leu lie Asp Ser Asp Tyr Gin Gly Gin Leu Met lie 
15 10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Ala Gly Val Val Asp Arg Asp Tyr Thr Gly Glu Val Lys Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



M (2) INFORMATION FOR SEQ ID NO: 79: 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE : NO 



(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ala Gly Val lie Asp Glu Asp Tyr Arg Gly Asn Val Gly Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Thr Gly Leu lie Asp Pro Gly Phe Gin Gly Glu Leu Lys Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GACGACGACA AGATGCTACT TCCAGACTGG AAA 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
GGAACAAGAC CCGTCCCACT TTCACAGATG AAGAG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85 
CTCCATGTCC CAACTCCGAT CAC 
(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GGTTTTCCCA GTCACGACGT TGTAAAACGA CGGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GGUUUUCCCA GUCACGACGU UGUAAAACGA CGGCCAGU 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

"(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
GACGACGACA AGATGCCCTG CTCTGAAGAG ACACC 
(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 



GGAACAAGAC CCGTTTAATT CTTTCCAGTG GAACC 



